TSSPlant: a new tool for prediction of plant Pol II promoters
نویسندگان
چکیده
Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/.
منابع مشابه
Genome-wide mapping of RNA Pol-II promoter usage in mouse tissues by ChIP-seq
Chromatin immunoprecipitation (ChIP), using antibody against RNA Pol-II, followed by massive parallel sequencing (ChIP-seq) are invaluable techniques for genome-wide identification of alternative promoters and their patterns of use in different tissues, cell types, and/or developmental stages. However, the identification of promoters cannot be performed solely based on the presence of Pol-II en...
متن کاملA new algorithm for solving Van der Pol equation based on piecewise spectral Adomian decomposition method
In this article, a new method is introduced to give approximate solution to Van der Pol equation. The proposed method is based on the combination of two different methods, the spectral Adomian decomposition method (SADM) and piecewise method, called the piecewise Adomian decomposition method (PSADM). The numerical results obtained from the proposed method show that this method is an...
متن کاملTwo distinct roles of ARABIDOPSIS HOMOLOG OF TRITHORAX1 (ATX1) at promoters and within transcribed regions of ATX1-regulated genes.
The Arabidopsis thaliana trithorax-like protein, ATX1, shares common structural domains, has similar histone methyltransferase (HMT) activity, and belongs in the same phylogenetic subgroup as its animal counterparts. Most of our knowledge of the role of HMTs in trimethylating lysine 4 of histone H3 (H3K4me3) in transcriptional regulation comes from studies of yeast and mammalian homologs. Littl...
متن کاملPHD and TFIIS-Like Domains of the Bye1 Transcription Factor Determine Its Multivalent Genomic Distribution
The BYpass of Ess1 (Bye1) protein is a putative S. cerevisiae transcription factor homologous to the human cancer-associated PHF3/DIDO family of proteins. Bye1 contains a Plant Homeodomain (PHD) and a TFIIS-like domain. The Bye1 PHD finger interacts with tri-methylated lysine 4 of histone H3 (H3K4me3) while the TFIIS-like domain binds to RNA polymerase (Pol) II. Here, we investigated the contri...
متن کاملTranscriptional regulation in plants: the importance of combinatorial control.
Combinatorial control: use of a discrete number of transcription factors in different combinations to give rise to a wide spectrum of expression patterns. Enhanceosome: a higher-order nucleoprotein complex that is formed by the binding of a specific combination of transcription factors to the transcriptional regulatory sequences of a particular gene. General transcription factors: components of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 45 شماره
صفحات -
تاریخ انتشار 2017